(navigation image)
Home Wayback Machine | Archive-It | Blog | Heritrix
Search: Advanced Search
Anonymous User (login or join us)
Upload

Download item

[item image]

Play / Download (help[help])


All Files: HTTPS

Resources

Bookmark

CuilCuil crawldata 20070708135553 to 20070708150615 (2007)

Crawl data captured by Cuil from Sun Jul 08 13:55:53 UTC 2007 to Sun Jul 08 15:06:15 UTC 2007


This item is part of the collection: Cuil Crawl Data

Mediatype: web
Identifier: cuil1-domainshard-corpus5-large-merge-rev1.00143-of-25000
Addeddate: 2013-03-27 18:28:13
Publicdate: 2013-03-27 20:17:19
Date: 2007
Firstfiledate: 20070708145628
Lastfiledate: 20070708145628
Scandate: 20070708145628
Bad_records: 86240
Good_records: 4188358
Contributor: Cuil
Creator: Cuil
Identifier-access: http://archive.org/details/cuil1-domainshard-corpus5-large-merge-rev1.00143-of-25000
Numwarcs: 1
Sponsor: Cuil
Imagecount: 4188359
Keywords: crawldata


Individual Files

Information FormatSize
cuil1-domainshard-corpus5-large-merge-rev1.00143-of-25000_files.xml Metadata [file]
cuil1-domainshard-corpus5-large-merge-rev1.00143-of-25000_meta.xml Metadata 1.1 KB

Be the first to write a review
Downloaded 23 times
Reviews


Terms of Use (10 Mar 2001)